Automatic Speaker Clustering

نویسندگان

  • Hubert Jin
  • Francis Kubala
  • Rich Schwartz
چکیده

This paper presents a fully automatic speaker clustering algorithm, which consists of three components: building a distance matrix based on Gaussian models of the acoustic segments; performing hierarchical clustering on the distance matrix with the prior assumption that consecutive segments should be more likely to come from the same speaker; and selecting the best clustering solution automatically by minimizing the within-cluster dispersion with some penalty against too many clusters. We applied this automatic speaker clustering technique in 1996 Hub4 evaluation, and the results show that it contributed signi cantly to the word error rate (WER) reduction in unsupervised adaptation. From our experiments, the algorithm seldom misclassi es segments from the same speaker into di erent clusters. We used the same clustering procedure for both partitioned evaluation (PE) and unpartitioned evaluation (UE) tests [1]. Experiments also show that this automatic speaker clustering algorithm improves unsupervised adaptation as much as the hand labeled ideal case where the clusters are generated based on true speaker, channel and background condition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Speaker

This paper presents a fully automatic speaker clustering algorithm , which consists of three components: building a distance matrix based on Gaussian models of the acoustic segments; performing hierarchical clustering on the distance matrix with the prior assumption that consecutive segments should be more likely to come from the same speaker; and selecting the best clustering solution automati...

متن کامل

Clustering Algorithm in Automatic Speaker Verification

We propose a new modeling approach in Automatic Speaker Verification A.S.V based on Gaussians Mixtures Models and Maximum a posteriori adaptation MAP. We propose clustering algorithm for intra and inter speaker’s variability in voice module and contribute for Universal Speaker Model design. We compare the traditional approach which uses one specific customer model with the second called Univers...

متن کامل

Rapid speaker adaptation using speaker clustering

This paper examines an approach to speaker adaptation called speaker cluster weighting (SCW) for rapid adaptation in the Jupiter weather information system. SCW extends the ideas of previous speaker cluster techniques by allowing the speaker cluster models (learned from training data) to be adaptively weighted to match the current speaker. We explore strategies for automatic speaker clustering ...

متن کامل

Clustering speakers by their voices

The problem of clustering speakers by their voices is addressed. With the mushrooming of available speech data from television broadcasts to voice mail, automatic systems for archive retrieval, organizing and labeling by speaker are necessary. Clustering conversations by speaker is a solution to all three of the above tasks. Another application for speaker clustering is to group utterances toge...

متن کامل

Automatic speaker clustering from multi-speaker utterances

Blind clustering of multi-person utterances by speaker is complicated by the fact that each utterance has at least two talkers. In the case of a two-person conversation, one can simply split each conversation into its respective speaker halves, but this introduces error which ultimately hurts clustering. We propose a clustering algorithm which is capable of associating each conversation with tw...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997